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Tongue diagnosis is a unique method in traditional Chinese medicine (TCM). This is the first investigation 
on the association between traditional tongue diagnosis and the tongue coating microbiome using 
next-generation sequencing. The study included 19 gastritis patients with a typical white-greasy or 
yellow-dense tongue coating corresponding to TCM Cold or Hot Syndrome respectively, as well as eight 
healthy volunteers. An Illumina paired-end, double-barcode 16S rRNA sequencing protocol was designed to 
profile the tongue-coating microbiome, from which approximately 3.7 million V6 tags for each sample were 
obtained. We identified 123 and 258 species-level OTUs that were enriched in patients with Cold/Hot 
Syndromes, respectively, representing "Cold Microbiota" and "Hot Microbiota". We further constructed the 
tongue microbiota-imbalanced networks associated with Cold/Hot Syndromes. The results reveal an 
important connection between the tongue-coating microbiome and traditional tongue diagnosis, and 
illustrate the potential of the tongue-coating microbiome as a novel holistic biomarker for characterizing 
patient subtypes. 



Tongue diagnosis is a non- invasive, simple and valuable diagnostic tool that has been repeatedly testified by 
clinical practitioners of traditional Chinese medicine (TCM) for at least 3,000 years (Fig. 1 A). TCM treats 
tongue appearance as an outer manifestation of the status of the human body. TCM practitioners differ- 
entiate conditions of individual patients according to the TCM Syndromes ("ZHENG" in Chinese), which are 
used as a holistic summary of the patient's status and are determined mainly by the colour and texture of the 
coating of the patient's tongue in addition to other symptoms 1 . The Cold Syndrome and Hot Syndrome are the 
two most common and representative syndromes that represent two opposite but interrelated conditions of the 
human body and are identified by a "white-greasy" and "yellow-dense" coating of the tongue, respectively. These 
syndromes have been used to characterise patient status in many diseases or conditions, including inflammation, 
infection, and stress, as well as immune and endocrine disorders 2 ' 3 . Cold and Hot Syndromes have also been a 
focus in our previously studies 4,5 . In addition to the extensive practice of tongue diagnosis in TCM, modern 
technologies, especially tongue imaging analyses, have been introduced for tongue diagnosis 6,7 . The relationships 
between tongue diagnosis and diseases including cancer 8 , gastritis and precancerous lesions 9,10 have been 
explored. However, the biological bases of different tongue-coating appearances are still poorly understood 
and lack systematic investigation at the molecular level. 

In modern Western medicine, the tongue has increasingly been viewed as an extension of the upper gastro- 
intestinal tract that can provide important clues to the body's current condition and global information about the 
body's health status. Some Western medicine literature has used the term "geographic tongue" to refer to tongues 
with discoloured regions or cracks that accompany illness or environmental sensitivity 11,12 . Some researchers have 
also considered TCM tongue diagnosis to be a helpful approach for clinical decision making in Western medi- 
cine 13 . For example, tongue coating can be a risk indicator for aspiration pneumonia in edentate patients, as it is 
associated with a number of viable salivary bacteria 14 . It has also been reported that amyloidosis of the tongue may 
be a diagnostic manifestation of plasmacytoma 15 . These examples indicate the potential for inferring systemic 
disorders from the tongue in clinical diagnoses. Compared with other diagnostic regimens, such as blood tests or 
tissue biopsies, tongue diagnosis is more amenable to and convenient for patients as well as doctors. Therefore, 
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Figure 1 | TCM tongue diagnosis and tongue-coating appearance classification. (A) An ancient instruction for tongue diagnosis recorded in Shang- 
han-dian-dian-jin, a TCM book compiled in the Ming Dynasty of China (1368-1644 AD). (B) Symptoms all over human body used as features for the 
TCM classification of Cold and Hot Syndromes; the tongue-coating feature is highly important in clinical practice. (C) Sampling images of tongue 
coating from the centre of the tongue, an area regarded as reflecting conditions of the "spleen-stomach" in the traditional tongue diagnosis. (D) Typical 
tongue-coating appearances: normal tongue coating of healthy control examples (upper), white-greasy tongue coating of Cold Syndrome examples 
(middle), and yellow-dense tongue coating of Hot Syndrome examples (lower). (E) Principal Component Analysis (PCA) of the tongue-coating images. 
The healthy controls, Cold Syndrome patients and Hot Syndrome patients are distributed in separate regions, indicating that the three classes are 
distinguishable based on their tongue-coating images. 



studying the molecular bases of tongue diagnosis is important for 
understanding this long-standing medical practice and might be a 
promising contribution to personalized medicine. 

Recent studies have indicated that the microbial community in the 
human body is associated with human physiology and pathology 16 . 
The microbiome on the tongue coating is one of the main micro- 
biomes of the human body 17 and is at the forefront of the human 
digestive system 18 . It was reported that the tongue shares consid- 
erable similarity with the gut not only in mucosal epithelia but also 
microbial diversity 19,20 . Densely populated by a variety of microor- 
ganism communities, the human gut microbiome is considered 
essential to maintaining natural host-environment interactions 
involved in nutrient absorption, epithelium regeneration, energy 



metabolism and immune response 21 . The characteristics and struc- 
tures of microbiomes are closely associated with the health of the 
gastrointestinal tract 22,23,24 . Patients with inflammatory bowel dis- 
ease, obesity, diabetes and colorectal cancer have been found to 
exhibit different gastrointestinal microbial structural patterns 25-28 . 

The characterization and differentiation of Cold and Hot 
Syndromes has played an important role in the clinical practice of 
TCM for gastrointestinal diseases, and the appearance of the tongue 
coating has long been a major parameter in discriminating the Cold 
and Hot Syndromes. We speculated that the microbiome composi- 
tion is a key factor that affects the appearance of the tongue coating 
and reasoned that the microbiome composition on the tongue coat- 
ing could be associated with characteristics of stomach diseases. If 
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Table 1 General 


information of patients and healthy controls 
Normal controls 


• • • • l r U c J 
Oastritis patients with Cola Syndrome 


• • • • L u C J 

Castritis patients with not Syndrome 


Number of Cases 


8 


9 


10 


Age (Mean±SD) 


26± 1 .2 


60±4.3 


53±4.2 


Male (%) 


3 (37.5%) 


4 (44.4%) 


4 (40%) 


Female (%) 


5 (62.5%) 


5 (55.6%) 


6 (60%) 



this is true, then the underlying association of the Cold and Hot 
Syndromes defined by tongue coating features may be rooted in 
the association of microbiome features with the health status of the 
human body. Therefore, we investigated the possible relationship 
between the tongue microbial community and the health status of 
the patient as described in TCM practice to begin to bridge the gap 
between traditional tongue diagnosis and molecular systems biology. 

The development of next- generation sequencing (NGS) tech- 
nology for characterising microbial communities provides an 
opportunity to deepen our understanding of the tongue-coating 
microbiome; thus, we used NGS to profile tongue-coating micro- 
biomes. We collected white-greasy and yellow-dense tongue-coating 
samples from Cold and Hot Syndrome patients with chronic 
atrophic gastritis (CAG), respectively, along with normal tongue- 
coating samples from healthy controls. The tongue-coating micro- 
biomes were sequenced using an accurate Illumina gaired-end with 
double-barcode (Ipedb for short) sequencing protocol that we 
developed. The protocol takes advantage of the sequencer's high- 
throughput capability and uses a double-barcode, paired-end strat- 
egy to ensure the accuracy of measuring 16S rRNA hyper- variable 
regions with short-read sequencing. To the best of our knowledge, 
this is the first analysis of tongue-coating microbiomes and their 
relation with traditional tongue diagnosis. As a result, we identified 
715 differentially abundant, species-level Operational Taxonomic 
Units (OTUs) on tongue coatings of the enrolled patients compared 
to healthy controls. We further identified two subtypes of tongue- 
coating microbiomes and detected 123 "Cold OTUs" and 258 "Hot 
OTUs" that are enriched in patients with Cold Syndrome and 
Hot Syndrome representing the "Cold Microbiota" and "Hot 
Microbiota", respectively. These observations revealed that the bio- 
logical bases underlying traditional tongue diagnosis may root in the 
relationship between the tongue-coating microbiome and the body's 
health status. 

Results 

Tongue-coating sample collection. We collected samples from 19 
CAG patients who were independently diagnosed by Western 
medical gastroenterologists according to "the Updated Sydney 
System" 2930 and by two TCM doctors according to the "National 
TCM Diagnosis Principle" 2 . According to Western medicine, these 
patients are considered to have the same disease, whereas in the view 
of TCM, they are further categorised into two subtypes, the Cold 
Syndrome (n=9) and Hot Syndrome (n=10), based on an overall 
medical examination of symptoms (Fig. IB). Despite the common 
manifestations experienced by the patients, such as abdominal 
discomfort or pain, nausea, vomiting and diarrhoea, patients with 
the Cold and Hot Syndromes exhibited characteristic differences. 
Tongue-coating appearance is one of the most important features 
in Syndrome classification. Fig. 1C shows the method we used to 
capture tongue-coating images according to instructions by TCM 
practitioners. We also recruited 8 volunteers who had reported no 
stomach discomforts in the past five years and showed normal 
tongue coating; these volunteers were the healthy controls in the 
study. For the selection of samples in this pilot study, we focused 
on the comparability of characteristic tongue appearances diagnosed 
by traditional Chinese medical doctors. The healthy volunteers with 
typical healthy white-thin tongue coatings were found to be younger 



than the study patients and matched the genders of the study groups, 
resulting in a group of sex-matched but age-unmatched normal 
controls. The genders and ages of gastritis patients with Cold Syn- 
drome and Hot Syndrome were well matched (Table 1). Repre- 
sentative tongue-coating images from patients with Cold and Hot 
Syndromes and from normal controls are shown in Fig. ID. Tongue- 
coating images of all the collected samples are provided in the 
Supplementary Materials (Fig. SI). Consistent with the traditional 
medical findings, the Cold Syndrome patients presented with the 
typical white-greasy tongue coatings, and Hot Syndrome patients 
exhibited the typical yellow-dense tongue coatings. In addition to 
the tongue-coating image features, the 9 Cold Syndrome patients 
presented with other typical features of Cold Syndrome according 
to their clinical records as follows: all patients reported that they 
preferred warm or hot food and drinks, and 7 reported abdominal 
distension that was relieved by using a warm compress or consuming 
hot drinks. Similarly, the 10 Hot Syndrome patients showed repre- 
sentative features in addition to the tongue-coating appearance as 
follows: 7 patients reported halitosis, and 6 reported yellow urine. 

We applied the Principal Component Analysis (PCA) to digital 
features of the tongue-coating images (see Methods). Fig. IE shows 
the distribution of samples in the space of the first two principal 
components of the PCA of the tongue-coating images. The first 
two principal components accounted for 67.5% of the total varia- 
tions. The samples were distributed in three regions correspond- 
ing to a white-greasy tongue coating in Cold Syndrome patients, a 
yellow-dense tongue coating in Hot Syndrome patients and a normal 
(white-thin) tongue coating in healthy controls (Fig. IE). This find- 
ing confirms the empirical judgments by TCM doctors using their 
naked eyes. We then collected microbiome samples from the tongue 
coatings of the study patients and normal controls (see Methods). 

16S rRNA sequencing analysis and taxonomy of the tongue- 
coating microbiome. We characterized the diversity of the 
tongue-coating microbiome via targeted 16S rRNA gene 
sequencing. We developed an Illumina paired-end with double 
barcodes (called Ipedb for short) sequencing protocol for this 
purpose (Fig. S2) and obtained 3.65 ±0.52 million high-quality V6 
(the 6th hyper-variable region in the 16S rRNA gene) sequences per 
sample from 221.33 million paired-end (PE) raw reads (Table SI). 
The details of the protocol are described in the Methods section. In 
brief, bacterial V6 regions were PCR-amplified with designed 
primers (Table S2), and PCR amplicons (120—150 bp) were 
barcoded at both ends and sequenced using the Illumina GAIIx 
with 120X2-bp PE reads. High-quality V6 sequences were yielded 
after overlapping each pair of raw PE reads; trimming barcodes, 
linkers and primers; and filtering low-quality or chimeric 
sequences (Fig. 2A). In the Ipedb protocol, the criterion for 
removing reads with mismatch(s) within the 90~120-bp 
overlapping region was crucial for reducing sequencing errors 
because singled-end reads demonstrated a high sequencing error 
rate in this region. The average sequencing error rate in the 
resulting V6 sequences was estimated at —0.22%, which was much 
less than the rate of 0.6— 1 .0% reported for Illumina GA 31 and the rate 
of 0.50% reported for pyrosequencing 32 . Additionally, our unique 
"double-barcode" strategy of adding barcodes at both ends enabled 
the identification of approximately 0.20% of the sequences as 
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Figure 2 | 16S rRNA sequencing analysis and taxonomy classification of the tongue-coating microbiome at the phylum level. (A) The flowchart of 
sequence preprocessing, quality control and chimera detection. (B) Relative abundances of the dominant phyla in samples. (C) Relative abundances of the 
rare phyla in samples. 
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chimeras generated during Illumina image processing, further 
increasing the sequencing accuracy. 

These sequences were first assigned to specific taxonomies using 
the GAST (Global Alignment for Sequencing Taxonomy) rRNA 
classifier 33 , which uses a reference database of SSU sequences to 
determine the taxonomy of hyper-variable-region tags. As shown 
in Fig. 2B and Fig. 2C, 95.23% of the sequences were classified into 
29 phyla in the Bacteria domain. The dominant phyla (relative abun- 
dance >1%) across all 27 samples were Firmicutes (30.40%±9.47%), 
Bacteroidetes (23.57%±7.57%), Proteobacteria (20.07%± 15.51%), 
Actinobacteria (15.79%±7.84%) and Fusobacteria (4.37%±2.02%). 
The rare phyla (relative abundance < 1%) were Acidobacteria, Aqui- 
ficae, Acidobacteria, Chlamydiae, Chlorobi, Chloroflexi, Cyanobac- 
teria, Gemmatimonadetes, Nitrospira, Planctomycetes, Spirochaetes, 
Synergistetes, Tenericutes, Thermi, Thermotogae, Verrucomicrobia 
and nine candidate divisions (NKB19, OP11, OP3, SPAM, SRI, 
TM6, TM7, WPS-2, ZB2). 

A comparison of the relative abundance of the dominant phyla 
between the normal controls and all gastritis patients showed no 
significant difference between the two groups. The phylum that 
showed the largest difference was Bacteroidetes, which was more 
abundant in the tongue-coating microbiomes of patients, although 
the difference was not significant (controls: 20.08% ±5.86%; patients: 
25.03% ±7.86%; two-tailed t-test, p = 0.44 after a Bonferroni correc- 
tion). The proportions of unclassifiable sequences also showed no 
significant differences between patients and controls (controls: 
2.99%±2.30%; patients: 5.50%±4.97%; two-tailed t-test, p = 0.085). 
Among the patient samples, Cold Syndrome patients had slightly 
more sequences that were not classified to specific phyla than Hot 
Syndrome patients (Cold Syndrome patients: 7.95%±6.31%; Hot 
Syndrome patients: 3.30%±1.58%; two-tailed t-test, p = 0.061), 
whereas no dominant phyla showed significant differences in relative 
abundance between the patients with the two Syndromes. 

We further conducted alpha diversity and beta diversity analyses 
to identify features of the tongue-coating microbiome composition 
that might be associated with the condition of patients with chronic 
atrophic gastritis, categorised as Hot or Cold Syndromes. V6 tag 
sequences were clustered into OTUs, which shared different levels 
of genetic similarities (unique sequence, 100%; species level, 97%; 
genus level, 95%; family level, 90%), using the CROP method 34 , an 
unsupervised Bayesian clustering algorithm. For each sample, a rar- 
efaction curve was drawn at a specific OTU level as the function of 
the observed number of OTUs on sequence counts by re-sampling at 
different sequencing depths (Figs. 3A and S3). Richness, evenness 
and phylogenetic diversity estimators were also calculated (see Figs. 
S4, S5 and S6 in the Supplementary Materials). The alpha diversities 
were consistent among tongue-coating microbial communities of 
different samples. 

We then compared the microbial community composition (based 
on presence / absence information of OTUs) and structure (based on 
relative abundances of OTUs) between samples by calculating the 
Jaccard distance 35 and the Bray-Curtis distance 36 , respectively. We 
applied the AMOVA (Analysis of Molecular Variances) method 37 
(see details in the Supplementary Materials) to determine whether 
there were stratifications in the samples. With the distances based on 
species-level OTUs, AMOVA showed that microbial communities 
on the tongue coatings of study patients and normal controls exhib- 
ited significantly different community memberships (P=0.005), but 
their community structures did not show significant difference 
(P = 0.364). We also used the Unifrac distance based on the compar- 
ison of phylogenetic trees 38 in the AMOVA analysis. With the 
unweighted Unifrac distance, a noticeable difference was observed 
between the two groups, although it was not statistically significant 
(P = 0.063). This finding indicates that there can be differences in the 
OTU compositions of tongue-coating microbiomes between differ- 
ent types of samples. We used the Principal Coordination Analysis 



(PCoA) method 39 to visualise such differences. PCoA is a variation of 
the MDS (Multi-Dimensional Scaling) method that visualises a set of 
samples on a two-dimensional plot by keeping the relative dissim- 
ilarity relations of the samples defined by a matrix of dissimilarity 
measures or distances between all pairs of samples. Fig. 3B shows the 
PCoA plot of the 27 samples when the Jaccard distance measure was 
used. Although there was not a clear boundary between the patients 
and normal control samples, there was a tendency for the normal 
controls, Hot Syndrome patients and Cold Syndrome patients to be 
locally distributed as three groups with some overlap on the plot. 

We were more interested in whether there were features of the 
tongue-coating microbiomes that were significantly different 
between patients with the two Syndromes. We therefore focused 
on the 19 gastritis patient samples for further analysis. We used four 
types of distance measures - Jaccard, Bary-Curtis, unweighted 
Unifrac and weighted Unifrac - to obtained the dissimilarity mat- 
rices of the samples and applied AMOVA to these matrices to check 
whether there was significant stratification between the Cold and 
Hot Syndrome patients. As a result, we found significant differences 
between 9 Cold Syndrome samples and 10 Hot Syndrome samples 
within the Jaccard and unweighted Unifrac distance matrices 
(P=0.027, Fig. 3C; P=0.043, Fig. 3D). This finding indicates that 
there are systemic differences in the tongue-coating microbial com- 
position between the two TCM-defined groups. We also checked 
other factors that could group the patients, including gender and 
Helicobacter pylori positivity, using the AMOVA method. In contrast 
to the significant differences found between the two Syndromes, no 
significant difference was detected between male (n=8) and female 
(n=10) cases or between Hp-positive (n=6) and -negative cases 
(n=13) (Figs. 3Cand3D). 

Cold and hot microbiota of the tongue-coating microbiome. The 

above analyses showed that there is information in the microbial 
composition that can discriminate Hot Syndrome from Cold 
Syndrome. We therefore adopted a supervised analysis approach 
to further study the phylotypes that could distinguish the tongue- 
coating microbiomes of the Cold and Hot Syndrome patients. We 
employed the DEGseq tool 40 , which was originally designed to find 
differentially expressed genes with RNA-seq read counts. First, we 
compared the sequence abundances of all of the 6,103 species-level 
OTUs between the normal controls and gastritis patients. A total of 
755 species-level OTUs showed significant differential abundance 
between the two groups, with p-value<0.001 (FDR<0.005). The 
OTUs that were represented in fewer than three samples were then 
screened, which included 40 OTUs. Among the remaining 715 
OTUs, 483 were abundant in study patients, and 232 were 
abundant in controls. Among the 483 abundant OTUs in study 
patients, we identified 381 that were significantly different between 
patients with Cold and Hot Syndromes by comparing their 
abundances in the two patient groups. A total of 123 OTUs were 
highly enriched in Cold Syndrome patients, and 258 OTUs were 
highly enriched in Hot Syndrome patients. These "Cold OTUs" 
and "Hot OTUs represented the "Cold Microbiota" and "Hot 
Microbiota", respectively. The other 102 OTUs that distinguished 
patients from controls did not show significant differences between 
patients with Cold and Hot Syndromes; these are referred to as 
"Shared OTUs". The 123 Cold OTUs were classified to taxonomy 
by the GAST classifier 33 . Among them, 83 Cold OTUs were classified 
into 24 genera, including 28 Cold OTUs more precisely classified into 
16 species. Similarly, among the 258 Hot OTUs, 170 were classified 
into 50 genera, including 57 Hot OTUs classified into 37 species. 
Some of the genera and species identified in the Cold and Hot OTUs 
overlapped. As shown in Fig. 4, we constructed two co-occurrence 
networks with a Pearson Correlation coefficient > 0.85 of 83 genus- 
classifiable OTUs in the Cold Microbiota and 170 genus-classifiable 
OTUs in the Hot Microbiota, illustrating potential relationships 
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Figure 3 | 16S rRNA diversity analysis detected significant differences in the tongue-coating microbiome between gastritis patients with Cold and Hot 
Syndromes. (A) Alpha diversity rarefaction curves of all 27 samples based on species-level OTUs. (B) PCoA plot of the between-sample Jaccard matrix of 
all 27 samples. ( C) AM OVA results of the between-sample Jaccard distance matrix of the 1 9 gastritis patients. The F-statistic is the ratio of the mean square 
among groups within the groups. (D) AMOVA results of the between-sample unweighted Unifrac distance matrix of the 19 gastritis patients. 
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Figure 4 | Tongue microbiota-imbalanced networks of genus-classifiable OTUs in (A) the Cold Microbiota and (B) the Hot Microbiota. OTUs as 
nodes are marked with the genus names (some with more precise species names), and edges with Pearson's correlation coefficient (PCC) > 0.85 are 
indicated. 
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Table 2 


Summary of functions of Cold OTUs 






#OTUs 


Taxonomy Assignment 


Function 


References 


o 
o 


. ... 
Lactobacillus sp. 

Lactobacillus pontis 

Lactobacillus salivarius 

Streptococcus sp. 

0 1 1 cUI ULOL.LU i UllUlllUoUo 


lactic acid bacteria 


A ~7 
4/ 




Lactobacillus solivorius 


, . .. 
probiotics 


4ft 




Bifidobacterium pseuclocatenulatum 


AO 
AY 




Veillonella sp. 


metabolizing lactic acid 


ou 




Parascardovia denticolens 


dental caries 


O 1 




Porphyromonas endodontalis 


periodontical diseases (gingivitis or periodontitis) 


^9 
JZ 




Actinomyces oris (lowly-abundant) 




CO 

JO 




Dialister invisus 


endodontic infections 


RA 




Ol 1 u 1 IICWCJI II 1 in oUlellcb 




•J -J 


, 


ricVUIfcJIItJ Ul 15 


UULIfcM ClIHU UMU cIlUUCUIUIMo 


JU 




Leptotrichia trevisanii 




57 




Abiotrophia defectiva 




58 




Shuttleworthia satelles 




59 




Actinomyces odontolyticus 




60 




Peptostreptococcus sp. 


infection in/on all body sites 


61 




Aerococcus urinae 


urinary tract infection 


62 



among the two groups of microbiota presented in patients with Cold 
and Hot Syndromes, respectively. In total, the 251 Cold or Hot OTUs 
were classified into 61 genera, including 85 Cold or Hot OTUs more 
precisely classified into 49 species (Table S3). 

Discussion 

In the long history of traditional clinical practice in China and other 
Eastern countries, TCM practitioners have typically classified 
patients of the same disease into sub-groups as different 
Syndromes from a holistic perspective on patients' overall status. 
Many features are used in the discrimination of Syndromes, but 
the tongue-coating appearance is one of the major factors. It has 
been understood that the human microbiome is related to human 
health status and many diseases 41,42 . We hypothesized that the 
tongue-coating appearance may reflect characteristics of the tongue 
microbiome, which is associated with the status of the human body. 
To test this hypothesis, we conducted a next-generation sequencing 
analysis of tongue-coating microbiome samples from 9 Cold 
Syndrome patients with a white-greasy tongue coating, 10 Hot 
Syndrome patients with a yellow-dense tongue coating, and 8 
healthy cases with a normal tongue coating. We designed an 
Illumina paired-end with double-barcode (Ipedb) protocol to 
sequence the 16S rRNA V6 regions of the microbiome, which 
was more accurate and had deeper coverage than existing methods. 

Compared with the available healthy tongue dorsa microbiomes 43 
we found that although different 16S rRNA library construction 
techniques and data analysis methods were used, the results from 
our normal control samples and those of previous studies 43 46 
showed consistent phyla catalogues, namely Firmicutes, Bacte- 
roidetes, Proteobacteria, Fusobacteria, and Actinobacteria, as the 
main parts of the tongue microbiome. Moreover, both our study 
and others reported abundance information 44,46 showing that 
Firmicutes, Proteobacteria and Bacteroidetes were the most abun- 
dant phyla, followed by Fusobacteria and Actinobacteria in 
healthy tongue dorsa microbiomes. The only difference is the 
increased level of Actinobacteria and decreased level of Bacte- 
roidetes in three samples 44 compared with our results. The com- 
parison results demonstrate the reliability and consistency of our 
study in terms of healthy tongue microbiome. 

Statistical analysis of the obtained sequences demonstrated the 
potential connection between the tongue-coating microbiome and 
traditional tongue diagnosis. Two subtypes of tongue-coating micro- 
biomes were identified in gastritis patients who correspond well to 



the Cold and Hot Syndromes. We identified 715 species-level OTUs 
in the tongue-coating microbiome of chronic atrophic gastritis 
patients who were different in abundance compared to the controls 
and were further classified into Cold, Hot or Shared OTUs. A total of 
251 of these OTUs were assigned to 61 genera and 49 species, imply- 
ing the potential of using these 61 genera and 49 species as biomar- 
kers for the status of patients (Table S4). Because the oral cavity is at 
the forefront of the alimentary and respiratory tracts, it is reasonable 
that the tongue-coating microbiome might reflect the status of the 
gastrointestinal and respiratory ecosystems. 

We further explored the function of the OTUs and the possible 
relationship between the Cold and Hot OTUs and body symptoms in 
human diseases by studying these genera or species in the literature; 
the functions of Cold OTUs are summarised in Table 2, and those of 
Hot OTUs are in Table 3. Interestingly, the functional role of Hot 
OTUs can clearly explain Hot Syndrome. Hot OTUs are closely 
associated with Hot Syndrome, namely, the well-accepted concept 
" Shang-Huo " in Chinese, a condition manifested as inflammation or 
irritability in some parts of body. For example, 4 Hot OTUs identified 
as Alloprevotella tannerae and Solobacterium moorei were respons- 
ible for halitosis or bad breath in the oral cavity 79,80 . Six Hot OTUs 
identified as Cory neb acterium argentoratense were regarded as a 
pathogen of tonsillitis 81 . Twenty Hot OTUs were identified as 
Prevotella nigrescens, Capnocytophaga sputigena, and others, which 
are pathogens of periodontal diseases or highly abundant in patients 
suffering from periodontal disease (see references in Table 3). In 
contrast, 1 Cold OTU was identified as Actinomyces oris, which 
was not highly abundant in periodontitis patients 53 . Only 2 Cold 
OTUs identified as Porphyromonas endodontalis have been reported 
as possibly pathogenic in periodontitis 52 . Therefore, tongue-micro- 
biome analysis provides new findings that effectively explain Hot 
Syndrome in traditional Chinese medicine. 

As shown in Table 2 and Table 3, the potential functions of the 
Cold and Hot Microbiota show some consistency with the metabol- 
ism-immune imbalance underlying Cold and Hot Syndrome as 
revealed in our previous studies 4,5 . For example, three dominant 
genera in the gastric microbiome of gastric cancer patients, namely, 
Lactobacillus, Veillonella and Streptococcus, exhibited different pat- 
terns in Cold and Hot OTUs. In addition, some Hot OTUs were 
assigned to Solobacterium moorei, Corynebacterium argentoratense, 
Prevotella nigrescens, and other species. Pathological studies indicate 
that these bacteria are associated with TCM Hot Syndrome-related 
inflammation or irritability such as halitosis, tonsil infection, and 



SCIENTIFIC REPORTS | 2 : 936 | DOI: 1 0. 1 038/srep00936 



8 



Table 3 


Summary of functions of Hot OTUs 






#OTUs 


Taxonomy Assignment 


Function 


References 


i 


Lactobacillus sp. 


lactic acid bacteria 


A7 


7 


Streptococcus sp. 






1 

1 


Streptococcus thermophilus 






1 

1 


Streptococcus minor 






1 

1 


Vagococcus sp. 








rVlUCiUU lUUUo 5U. 


UcMIUI LUI lei 


AT 

uo 


1 
1 


Pseudoramibacter alactolyticus 




A4 
04 


1 
1 


Scardovia inopinata 




Oj 


1 
1 


Treponema amylovorum 




OO 




P ro\/f"ifei /"i n i n rocron c 

1 ItJVUItNIU r 1 1 y 1 CoLCl lo 


t~\ar\r~ir\r\r\\\rTi\ r~\ i con co c Ifimnivitic or r^on/~iH/~\ntil , ici 

UtSI IUUUI II H_U 1 Ul JCU JCo IU 1 1 ILJ 1 Y 1 1 1 O KJ\ UC 1 IUUUI 1 1 1 1 1 Oj 


67 


2 


^UUIIULVIUUi IUUU bUUIIUcllU 


vJO 


l 

i 


Desulfomicrobium orale 






i 


Treponema socranskii 




/u 


o 


r UoUUULIcl 1 U 1 1 1 bU. 




71 


o 
z 


Porphyromonas g i ng iva lis 




79 "7T 

/ z,/ o 


1 

1 


Streptococcus sanguinis 






1 
1 


Tannerella forsythia 






2 


riGVUIeJIItJ 1 1 1 lei 1 1 IfcJU 1 U 






o 
z 


^Aogibacterium sp. 






1 

1 


Olsenella sp. 






1 


1 1 1 /^i 1 1 c f o r nnoi i m ncm fa c 
L/IUMolGl UilcUIIlUolillcb 




/ -J 


n 
z 


Parvimonas micra 




7A 
/O 


O 


Capnocytophaga sp. 


1 ill 

dental plaque 


77 


z 


Capnocytophaga sputigena 






1 
I 


Capnocytophaga haemolytica 






1 


Rothia aeria 


i i- i 
degrading gluten 


7Q 

/o 


o 


Rothia mucilaginosa 






o 
z 


Alloprevotella tannerae 


halitosis 


70 

/ y 


o 
z 


Solobacterium moorei 




Ov 


(5 


i /~ir\/nor\nfYori i im n rnonfnrnfonco 
v \j\ y i itsut_n_itsi iuiii uiljciiiuiuio i jc 


tnii c i II i fi c 

IUI lOl II 1 1 1 o 


81 


9 
z 


Prevotella oris 


bacteremia and endocarditis 


JO 


i 


Cardiobacterium hominis 




82 


i 


Gemella sanguinis 




83 


i 


Capnocytophaga haemolytica 




84 


2 


Actinomyces odontolyticus 




60 


2 


Peptostreptococcus sp. 


infection in/on all body sites 


61 


1 


Agrobacterium sp. 


opportunistic infections 


85 



periodontal inflammation (Table 3). Each group of Cold or Hot 
OTUs also has a potential symbiotic relationship and forms a micro - 
biota-imbalanced network (Fig. 4). Taken together, we conclude that 
the abnormal appearances of the tongue coating as observed by TCM 
tongue diagnosis actually reflect the imbalanced tongue-coating 
microbiome in the entire human ecosystem. Therefore, this is the 
first study to suggest that the TCM diagnosis of Cold and Hot 
Syndromes based on tongue-coating appearance successfully detects 
signals in the tongue-coating microbiomes that are indicators of the 
entire body's status. 

Other available factors, such as gender and Helicobacter pylori 
infection, did not significantly affect the microbiome features. 
Although it had been well established that the bacterium 
Helicobacter pylori is closely related to gastritis and gastric ulcers 86 
and that Helicobacter pylori could be detected in the saliva and 
supragingival and subgingival plaque of periodontitis patients 87 , we 
did not detect the bacterium on tongue coatings of either 
Helicobacter pylori-positive or -negative samples. This result is con- 
sistent with a recent study that found that the oral cavity is not a 
reservoir for Helicobacter pylori in patients with epigastric pain 
syndrome 88 . 

We acknowledge that there is a limitation in the present work in 
that the age of normal controls did not match that of study patients, 
and more investigation into the impact of age on the tongue micro- 
biome is required in future work. Nevertheless, we believe that the 
major conclusions of the current study are still valid because of the 
following: first, youth are more likely to form a healthy tongue 



coating and thus serve as a good control against abnormal tongue 
appearances; second, our results for normal controls are in accord- 
ance with a healthy tongue dorsa microbiome for people between 18 
and 40 years of age 46 ; and third, both the sex and the ages of patients 
with Cold and Hot Syndromes were well matched in this study. 

In summary, by studying the tongue-coating microbiomes of 
chronic atrophic gastritis patients and healthy controls using next- 
generation sequencing and bioinformatics analysis, we revealed the 
possible mechanism underlying the traditional tongue diagnosis and 
highlighted the potential for using tongue coating microbiome fea- 
tures as biomarkers to reflect the holistic status of the human body, 
especially in gastrointestinal diseases. The current sample size of this 
study is still small, and further investigations with more samples and 
complete metagenome sequencing are needed for a comprehensive 
understanding of the system. The observations described here might 
provide a new direction for understanding the biological basis of 
TCM tongue diagnosis and the linkbetween the "geographic tongue" 
and the human body's health status observed in modern Western 
medicine. 

Methods 

Tongue-coating samples. This study was approved by the Medical Ethical 
Committee of the Beijing Dongzhimen TCM Hospital. All patients and healthy 
volunteers signed informed consent forms. The 19 gastritis patients were enrolled 
from Beijing Dongzhimen TCM Hospital and Xiyuan TCM Hospital; 9 of them were 
diagnosed with Cold Syndrome, and 10 of them were diagnosed with Hot Syndrome. 
Nine normal control samples were drawn from healthy volunteers with no stomach 
discomfort at Tsinghua University. All healthy volunteers reported no stomach 
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discomfort in the past 5 years and exhibited a normal tongue coating as judged by 
TCM doctors. Samples with medical histories that included the use of glucocorticoids 
and antibiotics and a history of smoking within the last 3 months were excluded. All 
patients were tested for Helicobacter pylori infection using the fast urea test and 
histology, and all healthy volunteers were tested using the 13C urea breath test. All 
tongue coatings were sampled and photographed in the morning prior to breakfast, 
and then the patients were examined by gastric endoscopy. All participants were 
required to brush their teeth before the sampling to rinse possible food contamination 
that might influence the tongue microbiome. One of the nine normal control samples 
was withdrawn from the study due to an abnormal passing rate (—30%) in data 
preprocessing, resulting in a final study group of 8 controls and 19 study patients. 

Tongue-imaging analysis. We used the DS01-B tongue manifestation acquisition 
instrument by DAOSH Co., Shanghai, China, to photograph the tongue coatings of 
patients and healthy volunteers (Fig. IB). A 400X400-pixel image was cut from the 
centre of each tongue coating image, where the "spleen-stomach" conditions is 
reflected in the traditional tongue diagnosis, by a certified and experienced TCM 
doctor. The area is regarded to be associated with conditions of "spleen-stomach" in 
the TCM tongue diagnosis. A total of 18 digital features were extracted from every 
400 X 400-pixel image in two steps. First, we listed nine colour variables in three 
common three-dimensional colour spaces for each pixel. These variables are hue, 
saturation and brightness in the HSB colour space; x, y and Y in the CIExyY colour 
space; and L, a and b in the CIELab colour space. Second, we summarised the mean 
and the variance of nine colour variables among 400X400 pixels, resulting in 18 
digital features of the image. Finally, Principal Component Analysis (PCA) was 
performed on the 18 digital features of the 27 images. 

16S rRNA sequencing. DNA samples were extracted using Qiagen extraction kits 
(Qiagen, Hilden, Germany). Isolated DNA was used as a template for the 
amplification of the V6 region of 16S rRNA. All samples were amplified in 50 ul 
PrimerStar HS Premix (Takara, Dalian, China) that contained 2.5 ul template DNA 
and a final primer concentration of 0.4 uM. In the first round of PCR amplification for 
extracting V6 regions, the following parameters were used: 94° C for 5 min; then 16 
cycles of 94 D C for 45 sec, 52°C for 45 sec, 72°C for 60 sec, and, finally, 72 °C for 
10 min. In the second round of PCR amplification to introduce Illumina sequencing 
adapters, the following parameters were used: 94°C for 5 min; then 16 cycles of 94"C 
for 45 sec, 56°C for 45 sec, 72°C for 60 sec; and, finally, 72 "C for 10 min. Products 
were cleaned up with the QIAquick PCR purification kit (Qiagen, Hilden, Germany). 
Eight healthy samples, 9 Cold Syndrome samples and 10 Hot Syndrome samples were 
then sequenced on one flow cell (seven lanes and one phiX control lane) with a read 
length of 2X 120 bp using Illumina's Genome Analyzer IIx and the Ipedb protocol 
that we developed. Image analysis and base calling were completed using Real Time 
Analysis (RTA) vl.8. The details of the Ipedb protocol are described in the 
Supplementary Materials. 

Data preprocessing. Data preprocessing was carried out in a six-step pipeline. (1) 
Raw paired-end reads were removed if they contained ambiguous base calling N(s). 
(2) The remaining paired-end reads were assembled into sequences by overlapping 
their forward and reverse ends. (3) We trimmed barcodes from assembled sequences 
and used them to assign sequences to different samples. In this step, sequences 
survived if their barcodes at both ends indicated the same samples. (4) We trimmed 
linkers and primers from sequences to obtain V6 tags. (5) V6 tags with low-quality 
scores were screened. The quality score of each base in a sequence was the larger that 
in the forward end and the reverse end. The minimum score of all bases in the V6 tags 
was >Q30, and the average score of all bases in the V6 tags was >Q38. (6) UCHIME 89 
was used to detect chimeras from high-quality V6 tags. Finally, we acquired high- 
quality, non-chimeric V6 tags from raw paired-end reads. 

Sequencing data analysis and statistical approaches. These high-quality, non- 
chimeric sequences were assigned to taxonomy by the GAST rRNA classifier 33 and 
were clustered into OTUs at different genetic distances (0%, 3%, 5% and 10%) by 
CROP 34 , an unsupervised Bayesian clustering algorithm. We then used Mothur 90 to 
implement a series of OTU-based analyses of the alpha and beta diversities of the 
microbial communities. In the alpha diversity analysis, three richness estimators - the 
observed OTU number, Chaol 91 and ACE 92 - and two evenness estimators - the 
Shannon Index 93 and Simpson Index 94 - were calculated (Fig. S4). Rarefaction curves 
of observed OTU numbers on sampling depths were drawn with 10,000 sequences per 
step (Fig. 2). In the beta diversity analysis, a Jaccard membership dissimilarity 
estimator 35 and a Bray-Curtis structure dissimilarity estimator 36 were calculated. The 
phylogenetic- based Unifrac distance 38 was also analyzed. Principle Coordinate 
Analysis, or PCoA 3,9 was used to visualise the beta diversity. In addition, AMOVA 37 
was carried out by Mothur 90 to test the hypothesis that dissimilarities between 
samples from the same groups are significantly different compared to those from 
different groups. For each species-level OTU, a negative binomial test was performed 
using DEGSeq 40 to test whether the OTU was differentially abundant between healthy 
controls and chronic atrophic gastritis patients. 
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Cold Syndrome 

Pale complexion- 
White-greasy tongue 

Preference to warm or 
hot food and drinks 

Abdominal distension 
and being relieved 
with fomentation 

Clear abundant urine 

Cold limbs and 
loose stool 

Aversion to cold and 
preference to warmth 




If 



Hot Syndrome 

- Fever 
Dysphoria 

Yellow-dense tongue 
Halitosis 

Preference to cold food 
and drinks 

Burning sensation in 
stomach 



Yellow urine 
Constipation 

Aversion to warmth and 
preference to cold 
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Figure 1 | (A) An ancient instruction for tongue diagnosis recorded in Shang-han-dian-dian-jin, a TCM book compiled in the Ming Dynasty of China 
(1368-1644 AD). (B) Symptoms all over human body used as features for the TCM classification of Cold and Hot Syndromes; the tongue-coating feature 
is highly important in clinical practice. (C) Sampling images of tongue coating from the centre of the tongue, an area regarded as reflecting conditions of 
the "spleen-stomach" in the traditional tongue diagnosis. (D) Typical tongue-coating appearances: normal tongue coating of healthy control examples 
(upper), white-greasy tongue coating of Cold Syndrome examples (middle), and yellow-dense tongue coating of Hot Syndrome examples (lower). 
(E) Principal Component Analysis (PCA) of the tongue-coating images. The healthy controls, Cold Syndrome patients and Hot Syndrome patients are 
distributed in separate regions, indicating that the three classes are distinguishable based on their tongue-coating images. 
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Figure 3 | (A) Alpha diversity rarefaction curves of all 27 samples based on species-level OTUs. (B) PCoA plot of the between-sample Jaccard matrix of all 
27 samples. (C) AMOVA results of the between-sample Jaccard distance matrix of the 19 gastritis patients. The F-statistic is the ratio of the mean square 
among groups within the groups. (D) AMOVA results of the between-sample unweighted Unifrac distance matrix of the 19 gastritis patients. 
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